Rank in Wordlist | Frequency | Word |
---|---|---|
7706 | 23 | 2,5 |
8569 | 20 | 4,5 |
8900 | 19 | 0,3% |
8902 | 19 | 1,5 |
9281 | 18 | 0,5% |
9282 | 18 | 0,7% |
9289 | 18 | 3,5 |
10556 | 15 | 1,2 |
11096 | 14 | 0,2% |
11098 | 14 | 1,1 |
Rank in Wordlist | Frequency | Word |
---|---|---|
50234 | 2 | la(s |
54638 | 2 | «(… |
64805 | 1 | Devesa-Güell'08(va |
66232 | 1 | F(a)usta |
66381 | 1 | Faith'(Fe |
66914 | 1 | Fotògraf(e)s |
67289 | 1 | Game(Desxifrant |
68784 | 1 | Imatge(CRDI |
74213 | 1 | Ordre(s |
76958 | 1 | Rwanda(1994 |
Rank in Wordlist | Frequency | Word |
---|---|---|
18179 | 7 | %) |
39560 | 2 | Alemanya)VilaWeb |
44429 | 2 | a)phònica |
54968 | 1 | %), |
54969 | 1 | %). |
55241 | 1 | 1-1)Davant |
55616 | 1 | 11%).’ |
56361 | 1 | 1870). |
56372 | 1 | 1892),. |
56445 | 1 | 1927-2002)Poso |
Rank in Wordlist | Frequency | Word |
---|---|---|
2221 | 98 | 50% |
2350 | 93 | 10% |
2785 | 77 | 100% |
2786 | 77 | 20% |
3069 | 69 | 80% |
3174 | 67 | 25% |
3175 | 67 | 30% |
3598 | 58 | 3% |
3810 | 54 | 40% |
4011 | 51 | 70% |
Rank in Wordlist | Frequency | Word |
---|---|---|
59387 | 1 | AT&T |
59621 | 1 | Aerobic&Fitness |
60234 | 1 | Antic&Design |
60833 | 1 | BG&A |
64480 | 1 | Danger&Chikita |
65017 | 1 | Dolce&Gabbana |
66231 | 1 | F&B |
66710 | 1 | Fit&Sit |
66792 | 1 | Foam&Diamonds |
67372 | 1 | Gas&Power |
Rank in Wordlist | Frequency | Word |
---|---|---|
58520 | 1 | 60$ |
Rank in Wordlist | Frequency | Word |
---|---|---|
54967 | 1 | %" |
Rank in Wordlist | Frequency | Word |
---|---|---|
78 | 2191 | d'un |
85 | 2006 | s'ha |
89 | 1904 | d'una |
176 | 974 | l'any |
189 | 917 | s'han |
229 | 795 | d'aquest |
278 | 644 | l'Ajuntament |
298 | 617 | d'aquesta |
335 | 558 | l'equip |
399 | 485 | s'hi |
Rank in Wordlist | Frequency | Word |
---|---|---|
3148 | 68 | de/d |
4405 | 46 | 2/4 |
7423 | 25 | regio7Anoia/Baix |
8466 | 21 | i/o |
8476 | 21 | km/h |
10572 | 15 | 3/4 |
11715 | 13 | 1/4 |
20195 | 6 | 24/2015 |
20431 | 6 | EDIZIONES/Portaltic |
21651 | 6 | g/l |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots